Optimal distance separating halfspace ∗

نویسندگان

  • Frank Plastria
  • Emilio Carrizosa
چکیده

One recently proposed criterion to separate two datasets in discriminant analysis, is to use a hyperplane which minimises the sum of distances to it from all the misclassified data points. Here all distances are supposed to be measured by way of some fixed norm, while misclassification means lying on the wrong side of the hyperplane, or rather in the wrong halfspace. In this paper we study the problem of determining such an optimal halfspace. In dimension d, we prove that there always exists an optimal separating halfspace passing through d affinely independent data points. This directly shows that the problem is polynomially solvable in fixed dimension by an algorithm of O(n). If a different norm or gauge is used for each dataset in order to measure distances to the hyperplane, or if all distances are measured by a fixed (asymmetric) gauge, then one can still show that there always exists an optimal separating halfspace passing through d− 1 affinely independent data points. The one-dimensional problem is extremely easy to solve: it suffices to find a balancing separating point, i.e. yielding an equal number (or weight) of misclassifieds for each dataset. It also follows that in any dimension any optimal separating halfspace always balances the misclassified points, where the balancing criterion now takes the shape of the used gauges into account.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal Expected-Distance Separating Halfspace

One recently proposed criterion to separate two datasets in discriminant analysis, is to use a hyperplane which minimises the sum of distances to it from all the misclassified data points. Here all distances are supposed to be measured by way of some fixed norm, while misclassification means lying on the wrong side of the hyperplane, or rather in the wrong halfspace. In this paper we study the ...

متن کامل

Unified Distance Formulas for Halfspace Fog

In many real-time rendering applications, it is necessary to model a fog volume that is bounded by a single plane but is otherwise infinite in extent. This paper presents unified formulas that provide the correct distance traveled through a fog halfspace for all possible camera and surface point locations. Such formulas effectively remove the need to code for multiple cases separately, thereby ...

متن کامل

An optimization problem on the sphere

We prove existence and uniqueness of the minimizer for the average geodesic distance to the points of a geodesically convex set on the sphere. This implies a corresponding existence and uniqueness result for an optimal algorithm for halfspace learning, when data and target functions are drawn from the uniform distribution.

متن کامل

Optimal halfspace range reporting in three dimensions

We give the first optimal solution to a standard problem in computational geometry: three-dimensional halfspace range reporting. We show that n points in 3-d can be stored in a linear-space data structure so that all k points inside a query halfspace can be reported in O(log n + k) time. The data structure can be built in O(n logn) expected time. The previous methods with optimal query time req...

متن کامل

Approximate range searching in higher dimension

Applying standard dimensionality reduction techniques, we show how to perform approximate range searching in higher dimension while avoiding the curse of dimensionality. Given points in a unit ball in , an approximate halfspace range query counts (or reports) the points in a query halfspace; the qualifier “approximate” indicates that points within distance of the boundary of the halfspace might...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002